Value function approximation via low-rank models
نویسنده
چکیده
We propose a novel value function approximation technique for Markov decision processes. We consider the problem of compactly representing the state-action value function using a low-rank and sparse matrix model. The problem is to decompose a matrix that encodes the true value function into low-rank and sparse components, and we achieve this using Robust Principal Component Analysis (PCA). Under minimal assumptions, this Robust PCA problem can be solved exactly via the Principal Component Pursuit convex optimization problem. We experiment the procedure on several examples and demonstrate that our method yields approximations essentially identical to the true function.
منابع مشابه
Matrix Approximation under Local Low-Rank Assumption
Matrix approximation is a common tool in machine learning for building accurate prediction models for recommendation systems, text mining, and computer vision. A prevalent assumption in constructing matrix approximations is that the partially observed matrix is of low-rank. We propose a new matrix approximation model where we assume instead that the matrix is only locally of low-rank, leading t...
متن کاملSurvey on Probabilistic Models of Low-Rank Matrix Factorizations
Low-rank matrix factorizations such as Principal Component Analysis (PCA), Singular Value Decomposition (SVD) and Non-negative Matrix Factorization (NMF) are a large class of methods for pursuing the low-rank approximation of a given data matrix. The conventional factorization models are based on the assumption that the data matrices are contaminated stochastically by some type of noise. Thus t...
متن کاملTo “ Optimization via Low - Rank Approximation , with Applications to Community Detection in Networks ”
5.1. Proof of results in Section 3.1. Under degree-corrected block models, let us denote by Ā the conditional expectation of A given the degree parameters θ = (θ1, ..., θn) T . Note that if θi ≡ 1 then Ā = EA. Since Ā depends on θ, its eigenvalues and eigenvectors may not have a closed form. Nevertheless, we can approximate them using ρi and ūi from Lemma 3. To do so, we need the following lemma.
متن کاملRegression-aware decompositions
Linear least-squares regression with a “design” matrix A approximates a given matrix B via minimization of the spectralor Frobenius-norm discrepancy ‖AX − B‖ over every conformingly sized matrix X. Another popular approximation is low-rank approximation via principal component analysis (PCA) — which is essentially singular value decomposition (SVD) — or interpolative decomposition (ID). Classic...
متن کاملA Direct Solver for the Advection-diffusion Equation Using Green’s Functions and Low-rank Approximation
A new direct solution method for the advection-diffusion equation is presented. By employing a semi-implicit time discretisation, the equation is rewritten as a heat equation with source terms. The solution is obtained by discretely approximating the integral convolution of the associated Green’s function with advective source terms. The heat equation has an exponentially decaying Green’s funct...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- CoRR
دوره abs/1509.00061 شماره
صفحات -
تاریخ انتشار 2015